Continuous Performance Analysis of Fault-Tolerant Virtual Machines

نویسندگان

  • Boguslaw Jablkowski
  • Olaf Spinczyk
چکیده

Virtual machine technology has been successfully applied for the construction of fault-tolerant computing systems. For example, vmware Fault Tolerance and Xen Remus support transparent failover of VMs running on different physical machines in a local area network. However, high availability alone is in many application domains not sufficient. Especially in the context of Cyber-Physical Systems, which interact with the physical environment, realtime constraints have to be fulfilled in order to avoid damage. Therefore, we are working on the combination of VM-based fault tolerance with a performance analysis technique, namely the modular performance analysis with real-time calculus. Such enhanced system would at any time be aware of its own performance and could use this information for smarter reconfiguration decisions in case of faults. This paper will sketch the underlying model, the envisioned system architecture, and discuss beneficial application scenarios.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Tardigrade: Leveraging Lightweight Virtual Machines to Easily and Efficiently Construct Fault-Tolerant Services

Many services need to survive machine failures, but designing and deploying fault-tolerant services can be difficult and error-prone. In this work, we present Tardigrade, a system that deploys an existing, unmodified binary as a fault-tolerant service. Tardigrade replicates the service on several machines so that it continues running even when some of them fail. Yet, it keeps the service states...

متن کامل

Towards High-performance and Fault-tolerant Distributed Java Implementations

Java Virtual Machines form an important part of the web and business server market. Distributed Java Virtual Machines have the potential to make a significant contribution to industries that utilize this technology. An attractive platform for this purpose is the cluster, a highly cost-effective and scalable parallel computer model. However, realizing on such a platform a high performance virtua...

متن کامل

A Replication-Based and Fault Tolerant Allocation Algorithm for Cloud Computing

The very large infrastructure and the increasing demand of services of cloud computing systems lead to the need of an effective fault tolerant allocation technique. In this paper, we address the problem of allocating user applications to the virtual machines of cloud computing systems so that failures can be avoided in the presence of faults. We employ job replication as an effective mechanism ...

متن کامل

Improved Fault Tolerant Elastic Scheduling Algorithm for Cloud Computing

The paper focus on Fault Tolerance, a long standing problem in cloud computing by extending Primary Backup model to include cloud features such as virtualization and elasticity. Fault tolerance is a challenging work in Cloud Computing as virtual machines are the basic computing instances rather than hosts that enable virtual machines to migrate to other hosts. The on demand provisioning of reso...

متن کامل

The Design and Evaluation of a Practical System for Fault-Tolerant Virtual Machines

We have implemented a commercial enterprise-grade system for providing fault-tolerant virtual machines, based on the approach of replicating the execution of a primary virtual machine (VM) via a backup virtual machine on another server. We have designed a complete system in VMware vSphere 4.0 that is easy to use, runs on commodity servers, and typically reduces performance of real applications ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012